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CLAIM OF PRIORITY 
[0001] This application claims priority from U.S. provisional patent 
Application No. 60/229,662, entitled "REDUCTION OF META DATA IN A 
NETWORK," filed August 31, 2000, incorporated herein by reference. 

5 

Field of the Invention 

[0002] The invention relates to the transfer of information over 
limited bandwidth networks, particularly, wireless networks, and specifically 
to the transfer of information intended for display on wireless Internet 
10 devices. 

BACKGROUND 

[0003] Currently, a number of different technologies seek to provide 
the benefits of the Internet, and the World Wide Web in particular, to users 
15 any time and anywhere they desire it through wireless technologies. 
However, both the wireless devices used to access the Internet and the 
networks which carry information to those devices have a number of 
limitations. 
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[0004] Wireless devices are significantly smaller and less powerful 
than desktop or laptop devices traditionally used to access the World Wide 
Web via a web browser. In addition, the wireless networks which connect 
these wireless devices to the Internet do not have the same bandwidth as 
5 land-based "wire line" systems, and provide this limited bandwidth at a 
higher cost, more limited availability and lower quality of service when 
compared with land-based systems. 

[0005] One attempt to address the limitations mentioned above is 
utilized by the Palm VII® wireless personal assistant and is known as 

10 "clipping." In the Palm® system, a customized wireless application is 
written and deployed in two parts: a Web-based back-end, which serves 
dynamic content, and a Palm query application (PQA), which resides on 
the Palm VII® itself. The disadvantage of the Palm system is that its 
proprietary nature requires that only certain types of pages be available to 

15 the Palm device. In effect, the PQA takes a "clipping" from the website 
with which it is permanently associated. This means that for each site, the 
content distributor must write and distribute a PQA that contains all the 
menus and forms needed to input and output data on that site. This 
requires hosting extra files to build the clippings on the fly using any 

20 number of standard blank applications such as cold fusion, active server 
pages, or CGI. The advantage of clipping is that it keeps costly wireless 
transactions to a minimum. Only the information that needs to be updated 
is sent over the radio network. The downside is that all interactions with 
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the website must be planned ahead of time and introduction of new 
material requires writing and compiling a new version of the PQA. Nor can 
users freely browse a large site on their own to find what they need. 
[0006] One wireless application solution which is gaining popularity 

5 is wireless application protocol (WAP). WAP is a standard for bringing 
together wireless telephones and Internet content services regardless of 
the wireless network architecture or device type. WAP is designed to work 
with any type of underlying wireless network architecture, thereby freeing 
the provider to concentrate on the wireless application itself. 

10 [0007] As shown in Figure 1 , the WAP model presupposes a user 
agent 10, such as a cellular telephone or personal digital assistant (PDA), 
which is equipped with a micro browser. The WAP client 1 0 communicates 
directly with a server on the Internet 25 via a WAP gateway 20 as shown 
in Figure 1 . The WAP gateway server sits between a wireless carrier's 

15 network 15 on one side and the public Internet 25 on the other. (This 
configuration need not be limited to the public Internet, but may include 
private Intranets, so that gateways can be located within the carrier or 
corporate firewalls or both.) The WAP gateway 20 handles the interface 
between the two sets of network protocols, wireless WAP and wireline 

20 TCP/IP. The WAP gateway 20 decodes and decompresses wireless 
terminal requests and sends it on to the appropriate web server as an 
ordinary HTTP request. 

[0008] Certain wireless carriers have already implemented WAP 
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gateways. If a standard HTML document is served in response to an 
HTTP request from a PDA 10, the WAP gateway implements content 
translation before the request can be relayed back to the WAP client 10. 
The WAP gateway 20 also imposes data quantity limits on client 
5 responses. The gateway limitation means that for each given transaction, 
only a limited number of bytes may pass through the gateway. This so- 
called "gateway limit" defines the actual amount of data which may be 
returned in response to an HTTP request. 

[0009] Generally, WAP gateways have some form of limitation on 
10 the amount of data which is transmitted to the client 10. In some cases, 
the gateway limitation is at or about 1492 bytes. Hence, this presents an 
additional problem to content providers to design pages and applications 
which can provide useful content and information to a WAP client 10. 
[0010] Typically, data which is sent by web pages contains both data 
1 5 which the user sees and "meta-data" or information about the data in the 
web page that is used for a number of purposes, that the users generally 
do not see. For example, meta-data can include a number of meta "tags" 
identifying to search engines the type of document which is coded in a 
markup language (HTML, HDML, etc.) in order to allow search engines to 
20 quickly determine the type of document it is and return information on the 
document to users. Other types of lengthy data in web pages may include 
URLs (universal resource locators) which, given the nature of the Internet 
and the World Wide Web at present, can be very long. For example, many 
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Intemet websites today provide e-mail services to clients. A URL for an e- 
mail can be quite long, since the URL contains server information as well 
as user identification information. Such URLs can easily be 200 bytes in 
length, and may be embedded in a document and hence unseen to a user. 
5 [0011] Long URLs can be particularly problematic for portal sites 
which seek to provide the user with a customized home page tailored to 
the user's own design. Such sites allow the user to define the user's 
specific interests and activities such as mail, local information including 
weather, movie information, directions, etc., or other personalized 

1 0 information that the user wishes to see when they log on to their personal 
site. In such a case, links to each of the aforementioned types of 
information must provide, in the link, specific identification information 
regarding the type of information which the user seeks to retrieve (i.e. 
"provide weather information for San Francisco, CA"). Hence, it is quite 

15 conceivable that a user's customized homepage could have any number 
of long URL links. Translating this page to a wireless portal, given the 
WAP gateway limitations mentioned above, would quickly occupy the 
amount of available bandwidth to transmit the document via gateway to 
user. 

20 [0012] In view of the foregoing, techniques which reduce the 

bandwidth required by such pages and other Internet data are desirable. 
In addition, providing data efficiency without reducing functionality or 
features is desirable. 
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SUMMARY 

[0013] An embodiment of the invention, roughly described, 
comprises a method of encoding non-displayed data in a web page. In one 
aspect, the method comprises mapping data strings in a web page to a 
5 fixed-length string prior to sending the web page to a user; replacing said 
data strings in the web page with the fixed-length strings; and transmitting 
the page with said fixed-length strings. 

[0014] According to another aspect, the invention comprises a 
method of converting meta-data in a web page to a fixed-length string. The 
10 method includes the steps of converting a string of meta-data to a fixed- 
length string; replacing the string of meta-data in the web page with the 
fixed-length string; and transmitting the page. 



BRIEF DESCRIPTION OF THE DRAWINGS 
1 5 [001 5] The invention will be described with respect to the particular 
embodiments thereof. Other objects, features, and advantages of the 
invention will become apparent with reference to the specification and 
drawings in which: 

[0016] Figure 1 is a block diagram of a wireless device coupling to 
20 an Internet view of a WAP gateway; 

[0017] Figure 2 is a block diagram of a network used in accordance 
with an embodiment of the present invention; 

[001 8] Figure 3 is a graphic depicting the URL linking and reduction 
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table mapping of the method of an embodiment of the present invention; 
[0019] Figures 4 and 5 are flow charts representing a first 
embodiment of a method of the present invention for installing and utilizing 
fixed-length URLs in web pages; 
5 [0020] Figures 6 and 7 are flow charts representing a second 
embodiment utilizing a two-way hash to incorporate fixed-length URLs in 
web pages in accordance with the present invention. 

DETAILED DESCRIPTION 

10 [0021] Embodiments of the present invention seek to reduce the 
amount of data which is transmitted via a wireless network to increase 
efficiency of the network, without reducing the amount of information 
contained in the transmitted information. As a consequence, the perceived 
page size returned in response to an HTTP request is increased and the 

15 perceived speed of the network is improved. In accordance with an 
embodiment of the present invention, data of varying lengths is replaced 
with fixed-length data prior to transmission via the network. Requests for 
and references to the fixed-length data are referenced to the original data 
via an intermediate server and the original data manipulated in accordance 

20 with its intended use. In one embodiment, each URL in a web page which 
is to be transmitted via a gateway is reduced to a fixed length prior to 
transmitting the page. This fixed length, which may be in one example, 20 
bytes, substantially increases the allowable amount of displayable 
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information which may be provided in the page within the gateway 
limitation of the particular system. 

[0022] Figure 2 shows a wireless client device 10 which may be a 
personal digital assistant, a wireless Internet-enabled cellular phone, or any 
other device capable of accessing a wireless network 1 5 which is coupled 
to a WAP gateway 20. Within the public Internet 25, a particular website 
server 50 is coupled via WAP server 55 to provide wireless formatted 
content information to the wireless client 10. 

[0023] In accordance with an embodiment of the invention, the WAP 
server 55 receives content directly from site server 50, and in one 
embodiment, the present invention includes a series of processes enabled 
to process data on the WAP server 55 to perform a method of the 
invention. In one aspect, such processes convert URLs in a given page of 
web content to fixed- length URLs, and maintain a correlation between the 
fixed-length URL and the original URL to enable a user utilizing client 10 
to access data on the site server 50. Such correlation can be a table 
mapping fixed to variable-length data, or a reversible hash with keys 
tracked on a page or data-specific basis. In another embodiment, any 
meta-data string in a page may be converted to a fixed length and 
retrieved. 

[0024] Figure 3 shows a first embodiment of the system and method 
of the present invention in an implementation for reducing data to a fixed 
length. As shown in Figure 3, each page 60 which is provided from site 
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server 50 through WAP server 55 to PDA 10 in response to an HTTP 
request from PDA 10 has a unique identifier 62. The unique identifier 62 
may be, in one embodiment, a time mark in milliseconds, a page identifier, 
and a running integer (time%id%integer). The time in milliseconds maybe 
5 derived from a single epoch, which in one embodiment is the operating 
system's time for the operating system running the processes 
implementing an embodiment of the present invention. Since site server 
50 is capable of processing multiple pages in a single millisecond, the per- 
page ID and user ID as set forth above provide a unique identifier 62 per 

10 page. However, it should be recognized that the unique identifier 62 may 
take many forms and any number of identification schemes to provide a 
unique identifier 62 for page 60 in Figure 3 may be utilized in accordance 
with various embodiments of the present invention. As long as the 
identifier is unique, any format of identification will suffice. 

15 [0025] Mapping table 65 is maintained for each uniquely-identified 
page 60 provided by the processing server, in one case WAP server 55. 
Table 65 shows mapping of URLs, but the mapping may occur for any URL 
or meta-data in a page. Each link in the table represented in column 2 of 
table 65 is, in a first embodiment of the invention, mapped to a specific 

20 random URL identifier shown in column 1 . In this way, the long URLs are 
directly associated with the fixed-length URL and the longer URLs may be 
retrieved at a later time in accordance with a method of the present 
invention. 
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[0026] The random link in column 1 may be generated in any 
number of ways, as long as a fixed-length unique number is provided for 
each link. Given the number of links in even the longest web page, a 20 
byte link is sufficient to ensure unique randomness for each table 65 
5 associated with a unique page 60. 

[0027] Each actual link in page 60 (Link 1 , Link 2, Link 3...) is then 

replaced with its corresponding fixed-length link (RanLink 1 , RanLink 2..., 
etc.) to generate a shortened, mapped page 70. Each random link 
incorporates the page ID and the randomly-generated number associated 

10 with the specific link it replaces. 

[0028] Figure 4 represents a first embodiment of a method shown 

in Figure 3 whereby a web page incorporating a number of long meta-data 
strings are replaced with fixed-length strings in accordance with an 
embodiment of the present invention. Again, the method is described, for 

15 example purposes only, in the context of URLs. 

[0029] As one who is skilled in the art would appreciate, Figures 4-7 
illustrate logic boxes for performing specific functions. In alternative 
embodiments, more or fewer logic boxes may be used. In an embodiment 
of the present invention, a logic box may represent a software program, a 

20 software object, a software function, a software subroutine, a software 
method, a software instance, a code fragment, a hardware operation or 
user operation, singly or in combination. 

[0030] At logic box 100 in Figure 4, a web page 60 is received and 
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a unique page ID, in accordance with the foregoing description, is 
generated. 

[0031] Once a unique ID is generated, at logic box 102, for each 

URL on the page, a fixed random link is generated and the random fixed- 
5 length link is stored with the associated real URL in a table 65 as shown in 
Figure 3. In accordance with the method of the invention shown in Figure 
4, any fixed-length hash, such as the MD5 hash, may be utilized to 
generate the fixed-length URL identifier. MD5 hash is an algorithm which 
is known in the art. MD5 hash is often used to create a digital fingerprint 

10 or "message digest" of an input message. 

[0032] At logic box 1 02, this fixed-length mapping table is stored on 
a process server (WAP server 55, content server 50, gateway server 20, 
or any other process server configured to implement an embodiment of the 
present invention) as shown in Figure 2. Table 65 may be stored for any 

15 length of time, generally two to six hours, within which a user might be 
likely to return to this page and require the reference to the fixed-length 
URL to be transmuted back to the regular URL to retrieve the page. 
[0033] Once table 65 is generated, at logic box 104, a new page 70 
is generated by replacing each of the original URLs on page 60 with the 

20 corresponding fixed-length link. The replaced page is then transmitted at 
logic box 106 to the requesting user in response to the original HTTP 
request for page 60. 

[0034] Once the fixed-length page 70 is received by the user in 

Attorney Docket No.: YAHOO-01010US1 Express Mail No.: EL622696916US 

lharris/yahoo/1 010.001 



place of the original page 60, it is likely that the user will select one or more 
of the links replaced by the method of Figure 4. Figure 5 shows a method 
for processing an HTTP request incorporating a fixed-length link received 
from a client device 10 after it receives a mapped page 70 from logic box 
106 of Figure 4, according to an embodiment of the present invention. 
[0035] Once a URL request 110 is received, the page ID 

incorporated into the link is matched at logic box 112 with the correct link 
table 65 and the specific fixed-length link is matched to the original URL at 
logic box 1 14. Next, at logic box 116, the real URL is processed and an 
HTTP request for the page referenced by the real URL is sent, the real 
page retrieved, and the real page processed in accordance with the 
method of Figure 4 (or the alternative method of Figure 6, described 
below). 

[0036] Figures 6 and 7 show an alternative embodiment of the 
present invention wherein the tables 65 shown in Figure 3 need not be 
utilized. As shown in Figure 6, a web page with a number of URL links 
may be processed using a two-way hash algorithm, with the server 
retaining on a page-wise basis a key to reverse the hashing algorithm and 
thereby return the original URL. In accordance with this aspect of the 
present invention, a unique page ID is generated at logic box 100a, and at 
logic box 1 02a, for each URL on the page, a fixed random link is generated 
and a key for reversing the hashing algorithm is stored. In an embodiment, 
the key may be generated using MD5 hash, or may be any randomly- 
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generated number which is unique. 

[0037] Next, at logic box 104a, each random link is inserted into a 
new, mapped page which is sent at logic box 106a in response to the 
original HTTP request for page 75. 

[0038] In this embodiment, the table 65 of Figure 3 need not be 
generated or stored, but the key for reversing the hash is stored on a page- 
wise, link-wise or other level, in order to retrieve the original URL from the 
random link on page 60. The key for the hash may be retained on a page- 
by-page basis. 

[0039] When a URL request from a page incorporating the fixed- 
length URL 1 10a is received, as shown in Figure 7, once again the page 
ID is matched at logic box 1 12a, and at logic box 1 14a a reverse hash is 
performed to determine the original URL based on the key associated with 
the page. Once again, following generation and acquisition of the original 
URL from page 60, the URL is processed at logic box 116a and the 
method may return to processing the next HTML page. 
[0040] In accordance with an embodiment of the present invention, 
the amount of data per page transmitted to wireless devices via a wireless 
network is reduced without lessening the amount of information which is 
available to the user, and a cost savings in bandwidth is provided. The 
invention may be applied to any string of data in a web page which is not 
generally displayed to a user, however, the invention could be applied to 
reduce visible strings of data as well. For example, long URLs may be 
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reduced to a fixed byte length, providing the added benefit that the URL is 
now encrypted in transmission. Since in general, users of wireless 
networks pay premiums for the amount of bandwidth utilized and the 
amount of time the user is coupled to the wireless network, users see an 

5 overall cost savings. Thus, reducing the amount of data transmitted per 
unit of bandwidth increases value to the user. In addition, while the actual 
speed of the network is not improved, the perceived speed of the network 
is improved since the users retrieve more content per unit of bandwidth 
and less URL space is required in each transaction. 

10 [0041] While certain specific gateway data limitations have been 
herein mentioned, it should be recognized that any particular gateway 
limitation may be utilized and the specific values of 1492 bytes for the 
gateway limitation, 20 bytes for the fixed-length link, and the like, are 
exemplary only. 

15 [0042] The aforementioned benefits of perceived speed and 
reduction of data per unit of bandwidth while providing the same content 
per unit of bandwidth are achievable in non-wireless networks as well. The 
system of the present invention also increases the perceived time that a 
page is rendered since the processing occurring on the back end is 

20 relatively quick and much faster than the transmission latencies involved 
in processing the page transmission via a wireless network. 
[0043] In addition, the invention is not limited to use with links or 

URLs. Any meta-data stream can be reduced prior to transmission of the 
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stream to the user, since a user on a wireless client would likely have little 
use for such meta-data. 

[0044] These and other advantages of the present invention will be 
readily apparent to one of average skill in the art. All such features and 
advantages of the invention are intended to be within the scope of the 
invention as illustrated by the written description and the drawings, and 
defined by the attached claims. 
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